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WHAT IS CLAIMED IS: 

£2pCj^^ . A m^thgd of determining a title from a document image, comprising: 
5 dividing the^bcument image into minimal circumscribing rectangles which 

contain a character image^ 

recognizing characters in said minimal circumscribing rectangles; and 
determining a title oXthe document image based upon a likelihood of each of said 
minimal circumscribing rectangles containing a title, said likelihood being determined 
1 0 based upon information obtainedVluring said character recognition. 

2. The method of determining a title from a document image according to claim 1 
wherein said likelihood is expressedVin a sum of points based said information. 

15 3. The method of determinim; a title from a document image according to claim 2 

wherein said information includes characteristics on font. 
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4. The method of determining a title from a document image according to claim 2 
wherein said font characteristics is determined on a frequency of a particular font type . 

5. The method of determining a title from a document image according to claim 2 
wherein said character recognition further includes an act of matching said characters with 
a set of predetermined words, said predetermined words indicating said title. 



2 5 *£?£<^ *"y6. The method MtJ^termining a title from a document image according to claim 5 
wherein said information includes a result of said matching with said predetermined words. 



7. The method of 
wherein said information includes 
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determi^iing a title from a document image according to claim 2 
number of said characters. 
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8. The method of determining a title from a document image according to claim 7 
wherein said number of said characters is compared to a predetermined maximal threshold 
number. 

^P^J^^' The method of detern^ning a title from a document image according to claim 2 
wherein said information includes eSi assurance level of said character recognition. 

10. The method of determining a title from a document image according to claim 
10 9 wherein said assurance level is compared to a predetermined minimal threshold value. 



■^S^ 1 1 . The metfifc^of determining a title from a document image according to claim 
2 wherein said informationuncludes layout characteristics. 

15 12. The method of deiermining a title from a document image according to claim 

1 1 wherein said information includes centering, underlining and size. 

13. The method of determining a title from a document image according to claim 
2 wherein said information indicates whether or not said characters end in a noun form. 
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14. The method of determining a title from a document image according to claim 
2 wherein said information indicates whether or not said characters end in a set of 
predetermined suffixes. 

2 5 15. The method of determining a title from a document image according to claim 



2 wherein said information inclu< 
circumscribing rectangles. 



es a ratio between a length and a height of each of said 
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16. The method of determining a title from a document image according to claim 
2 wherein said information incliSKies a ratio between a summed width of said characters and 
a corresponding one of said circumscribing rectangles. 

17. The method of determining a title from a document image according to claim 
1 wherein said likelihood is adjusted according to a type of said image documents. 
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18. The method of determining a title from a document image according to claim 
1 wherein said title is combined with a keyword. 




19. A system fds^termining a title from a document image, comprising: 
a character row ar^determination unit for dividing the document image into 
minimal circumscribing rectangles which contain a character image; 

a character recognition ukut connected to said character row area determination 
unit for recognizing characters in saM minimal circumscribing rectangles; and 

a title evaluation point determination unit connected to said character recognition 
unit for determining a title of the document image based upon a likelihood of each of said 
minimal circumscribing rectangles containing a title, said likelihood being determined 
based upon information obtained during said character recognition. 



20. The system for determining a 
1 9 wherein said title evaluation point 
terms of a sum of points based said inform; 



itle from a document image according to claim 
detenhination unit determines said likelihood in 
ition. 



25 21 . The system for determining i title from a document image according to claim 

20 wherein said title evaluation point determination unit further comprises a font 
determination unit for generating infomiation on font characteristics. 
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22. The system for determining a title from a document image according to claim 
21 wherein said font determination unit determines said font characteristics based on a 
frequency of a particular font type. 

5 23. The system for determining a title from a document image according to claim 

20 wherein said title evaluation point determination unit further comprises a natural 
language analysis unit for matching said characters with a set of predetermined words, said 
predetermined words indicating said title. 



10 24. The system for determining a title from a document image according to claim 

23 wherein said natural language analysis unit generates a result of matching of said 
characters with said predetermined words. 

^^L^ ^25. The system for determining a title from a document image according to claim 
15 20 wherein said character recognition unit generates said information on a number of said 
characters. 



26. The system for determining a title from a document image according to claim 
25 wherein said title evaluation point determination unit compares said number of said 
2 0 characters to a predetermined maximal threshold number. 

$>C\ % **y27. The system for detenrftning a title from a document image according to claim 
20 wherein said character recognitiolj unit generates said information on an assurance level 
of said character recognition. 
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28. The system for determining a title from a document image according to claim 
27 wherein said title evaluation point determination unit compares said assurance level to a 
predetermined minimal threshold value. 
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29. The system for determining a title from a document image according to claim 
20 wherein said title evaluation point determination unit further comprises a characteristics 
extraction unit for extracting layout characteristics. 

5 ^£f&~*730. The system for determining a title from a document image according to claim 
29 wherein said extraction unit extracts said layout characteristics on centering, underlining 
and size. 

3 1 . The system for ^determining a title from a document image according to claim 
10 23 wherein said natural language analysis unit generates said information indicating 

whether or not said characters Aid in a noun form. 

32. The system for determining a title from a document image according to claim 
23 wherein said natural language analysis unit generates said information indicating 

1 5 whether or not said characters endlin a set of predetermined suffixes. 

33. The system for determining a title from a document image according to claim 
2 wherein said character row area determination unit generates said information on a ratio 
between a length and a height of eaih of said circumscribing rectangles. 
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34. The system for determining a title from a document image according to claim 
2 wherein said character row area determination unit generates said information on a ratio 
between a summed width of said characters and a corresponding one of said circumscribing 
rectangles. 

35. The system for determining a title from a document image according to claim 
19 wherein said likelihood is adjusted according to a type of said image documents. 



30 



36. The system for determining a title from a document image according to claim 
19 wherein said title is combined with a keyword. 
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